Multi-objective multiagent credit assignment in reinforcement learning and NSGA-II

نویسندگان

  • Logan Michael Yliniemi
  • Kagan Tumer
چکیده

Multiagent systems have had a powerful impact on the real world. Many of the systems it studies (air traffic, satellite coordination, rover exploration) are inherently multi-objective, but they are often treated as single-objective problems within the research. A key concept within multiagent systems is that of credit assignment: quantifying an individual agent’s impact on the overall system performance. In this work we extend the concept of credit assignment into multi-objective problems, broadening the traditional multiagent learning framework to account for multiple objectives. We apply credit assignment through difference evaluations to two different policy selection paradigms to demonstrate the broad applicability of the proposed approach. We first examine reinforcement learning, in which we improve performance by (i) increasing learning speed by up to 10x (ii) reducing sensitivity to unmodeled disturbances by up to 98.4% and (iii) producing solutions that dominate all solutions discovered by a traditional team-based credit assignment schema. We then examine a state-of-the-art multi-objective evolutionary algorithm, NSGA-II. We derive multiple methods for incorporating difference evaluations into the NSGA-II framework. Median performance of the NSGA-II considering credit assignment dominates best-case performance of NSGA-II not considering credit assignment in a multiagent multiobjective problem. Our results strongly suggest that in a multiagent multi-objective problem, proper credit assignment is at least as important to performance as the choice of multi-objective algorithm. This work was partially supported by the National Energy Technology Laboratory under grant DEFE0012302. Logan Yliniemi University of Nevada, Reno E-mail: [email protected] Kagan Tumer Oregon State University E-mail: [email protected] 2 Logan Yliniemi, Kagan Tumer

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multi-Objective Multiagent Credit Assignment in NSGA-II Using Difference Evaluations

Determining the contribution of an agent to a system-level objective function (credit assignment) is a key area of research in cooperative multiagent systems. Multi-objective optimization is a growing area of research, though mostly focused on single agent settings. Many real-world problems are multiagent and multi-objective, (e.g., air traffic management, scheduling observations across multipl...

متن کامل

Multi-objective Multiagent Credit Assignment Through Difference Rewards in Reinforcement Learning

Multiagent systems have had a powerful impact on the real world. Many of the systems it studies (air traffic, satellite coordination, rover exploration) are inherently multi-objective, but they are often treated as single-objective problems within the research. A very important concept within multiagent systems is that of credit assignment: clearly quantifying an individual agent’s impact on th...

متن کامل

Multiagent Credit Assignment in a Team of Cooperative Q-Learning Agents with a Parallel Task

Traditionally in many multiagent reinforcement learning researches, qualifying each individual agent’s behavior is responsibility of environment’s critic. However, in most practical cases, critic is not completely aware of effects of all agents’ actions on the team performance. Using agents’ learning history, it is possible to judge the correctness of their actions. To do so, we use team common...

متن کامل

Using communication to reduce locality in distributed multiagent learning

This paper attempts to bridge the elds of machine learning, robotics, and distributed AI. It discusses the use of communication in reducing the undesirable eeects of locality in fully distributed multi-agent systems with multiple agents/robots learning in parallel while interacting with each other. Two key problems, hidden state and credit assignment, are addressed by applying local undirected ...

متن کامل

Reinforcement learning for multi-step problems

In reinforcement learning for multi-step problems, the sparse nature of the feedback aggravates the difficulty of learning to perform. This paper explores the use of a reinforcement learning architecture, leading to a discussion of reinforcement learning in terms of feature abstraction, credit-assignment, and temporal-difference learning. Issues discussed include: the conditioning of the reinfo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Soft Comput.

دوره 20  شماره 

صفحات  -

تاریخ انتشار 2016